Model Selection

Common Voice dataset

# Common Voice dataset

Whisper Small Ta

This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.

Speech Recognition

Transformers Other

Whisper Small Fr

This is a Whisper-small speech recognition model fine-tuned on French datasets, reducing the word error rate by 6.793 percentage points compared to the baseline model.

Speech Recognition

Transformers French

Whisper Base Pl

A speech recognition model fine-tuned on the Polish Common Voice 17.0 dataset based on OpenAI Whisper-base

Speech Recognition

Transformers Other

Whisper Large V3 Cantonese

A Cantonese automatic speech recognition model fine-tuned on Whisper v3, trained on the Common Voice 17 dataset

Speech Recognition

Transformers Other

Finetuned Whisper Mr

A Whisper small speech recognition model fine-tuned on the Common Voice 17.0 Marathi dataset, based on simran14/mr-model-h

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Amharic Demo Colab

Amharic speech recognition model fine-tuned on the common_voice_16_1 dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Wav2vec2 Large Xls R 300m Albanian Colab

This model is a speech processing model fine - tuned on the common_voice_albanian dataset based on facebook/wav2vec2-xls-r-300m, suitable for Albanian - related tasks.

Speech Recognition

Wav2vec2 Large Xlsr Mvc Swahili

This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53, specifically designed for automatic speech recognition tasks in Swahili.

Speech Recognition

Transformers Other

Whisper Small Dv

A Dhivehi (official language of Maldives) automatic speech recognition model fine-tuned based on OpenAI Whisper-small, trained on Common Voice 13 dataset

Speech Recognition

Transformers Other

Whisper Small Fa

The Whisper (small) model fine-tuned by the Hezar team based on the Persian part of the Common Voice dataset, which can be used for automatic speech recognition tasks.

Speech Recognition Other

This is a Bengali automatic speech recognition model based on the Whisper small architecture, fine-tuned on approximately 400 hours of Mozilla Common Voice dataset with a word error rate of 4.58%

Speech Recognition

bangla-speech-processing

Whisper Large Persian

Persian automatic speech recognition model based on Whisper architecture, fine-tuned on Common Voice 11.0 Persian dataset

Speech Recognition

Transformers Other

Whisper Large V2 Kazakh

This model is a fine-tuned speech recognition model based on OpenAI's Whisper Large V2 on the Kazakh Common Voice 11.0 dataset

Speech Recognition

Transformers Other

Whisper Tiny Es

A speech recognition model fine-tuned on Spanish dataset based on OpenAI Whisper-tiny

Speech Recognition

Transformers Spanish

Exp W2v2t Fa Hubert S801

A Persian automatic speech recognition model fine-tuned from facebook/hubert-large-ll60k, trained using the Common Voice 7.0 Persian dataset.

Speech Recognition

Transformers Other

Exp W2v2t Sv Se Wavlm S42

A Swedish automatic speech recognition model fine-tuned from microsoft/wavlm-large, suitable for 16kHz sampled audio input.

Speech Recognition

Exp W2v2t It Wavlm S895

An Italian automatic speech recognition model fine-tuned based on microsoft/wavlm-large, trained using the Common Voice 7.0 Italian dataset.

Speech Recognition

Transformers Other

Exp W2v2t It No Pretraining S842

Fine-tuned from a randomly initialized wav2vec2 model for Italian speech recognition tasks, trained on the training split of Common Voice 7.0 (Italian).

Speech Recognition

Transformers Other

Exp W2v2t It Xlsr 53 S387

An Italian automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice 7.0 Italian dataset.

Speech Recognition

Transformers Other

Exp W2v2t It Wav2vec2 S609

An Italian automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 Italian dataset.

Speech Recognition

Transformers Other

Exp W2v2t Th Hubert S533

A Thai speech recognition model fine-tuned from facebook/hubert-large-ll60k, trained on data from Common Voice 7.0

Speech Recognition

Transformers Other

Exp W2v2t En Vp Nl S281

An English speech recognition model fine-tuned based on facebook/wav2vec2-large-nl-voxpopuli, trained using the Common Voice 7.0 training set.

Speech Recognition

Transformers English

Wav2vec2 Large Xls R 300m Tamil Colab

This model is a Tamil speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Model Facebookptbrlarge

A Brazilian Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53-portuguese model

Speech Recognition

Wav2vec2 Base Common Voice 50p Persian Colab

This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language, supporting Persian speech-to-text tasks.

Speech Recognition

Wav2vec2 Base Common Voice Persian Colab

This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language datasets, primarily used for Persian speech-to-text tasks.

Speech Recognition

Wav2vec2 Large Xls R 300m Turkish Colab Common Voice 8 5

This is a Turkish speech recognition model based on the wav2vec2 architecture, fine-tuned on the Common Voice dataset with a word error rate (WER) of 0.3634.

Speech Recognition

Wav2vec2 Xls R 300m Mr Cv9 With Lm

An automatic speech recognition model fine-tuned on Marathi speech datasets based on Facebook's XLS-R-300M model

Speech Recognition

Transformers Other

Wav2vec2 Xls R 300m Ur Cv9 With Lm

This model is an automatic speech recognition (ASR) model fine-tuned on Urdu speech datasets based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Common Voice Lithuanian Fairseq

A Lithuanian automatic speech recognition model trained on the Common Voice dataset, implemented using the wav2vec2 architecture and fairseq framework.

Speech Recognition

Transformers Other

Wav2vec2 Base Common Voice Fa Demo Colab

This model is a Persian speech recognition model fine-tuned based on facebook/wav2vec2-base, suitable for Persian speech-to-text tasks.

Speech Recognition

An automatic speech recognition system fine-tuned on the Common Voice 8 Belarusian dataset based on facebook/wav2vec2-base model

Speech Recognition

Transformers Other

Wav2vec2 Common Voice Tr Demo Dist

This model is an automatic speech recognition (ASR) model fine-tuned on the Turkish COMMON_VOICE dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate (WER) of 33.05% on the evaluation set.

Speech Recognition

Transformers Other

Automatic speech recognition model fine-tuned on Mozilla Common Voice Portuguese dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Sinai Voice Ar Stt

An Arabic speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m on the Common Voice Arabic dataset

Speech Recognition

Transformers Arabic

Wav2vec2 Large Xls R 300m El

This is an automatic speech recognition model fine-tuned on the Greek Common Voice 8 dataset, based on the facebook/wav2vec2-xls-r-300m model.

Speech Recognition

Transformers Other

Wav2vec2 Common Voice Ab Demo

A speech recognition model fine-tuned on the COMMON_VOICE - AB dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers Other

patrickvonplaten

Wav2vec2 Xlsr Lithuanian

This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-xls-r-1b on Lithuanian dataset

Speech Recognition

Transformers Other

Wav2vec2 Common Voice Tr Demo

This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE SV-SE dataset based on facebook/wav2vec2-large-xlsr-53, supporting Swedish speech recognition.

Speech Recognition

Wav2vec2 Large Xlsr Kinyarwanda Apostrophied

A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Kinyarwanda, capable of predicting apostrophes in marked pronouns and vowel-initial word contractions

Speech Recognition Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase